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METHOD AND APARATUS FOR CHARACTERIZING 
THE QUALITY OF A NETWORK PATH 

Field 

The invention relates generally to paths of one or more computer 
5 networks, and particularly relates to metrics characterizing the network paths. 

Background 

The use of efficient routing algorithms, such as the bellman-ford 
algorithm and the open shortest path (OSP) algorithm is highly desirable in 
complex networks. Until today, these algorithms use metrics that are mostly 
static, and that are based on very coarse approximations of path performance 
quality (e.g., number of hops, user-defined static costs that are associated to 
each of the links in the network). As the Internet becomes more and more 
ubiquitous, metrics that characterize the quality of network applications running 
across network paths become more important. Metrics for voice and video have 
been devised in the past; however these metrics are complex and not adapted for 
routing, and instead can be used to report performance at select points in the 
network. 

Summary 

N'letrics that at the same time are (1) additive and (2) characterize the 
20 performance of network applications are highly desirable,, as they allow routing 
algorithms and devices that can take into account factors that matter to the users 
of these applications. 

We describe multiple methods and apparatuses for characterizing the 
quality of a network path by means of a metric. This metric characterizes a 
25 plurality of one or more network applications running across a network path.. 

The quality characterization characterizes a quality of the same plurality of one 
or more network applications running at one or more end-points of the path. 
This metric is at least a function of a plurality of one or more elementary 
network parameters. The plurality of one or more network parameters include 
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one or more of delay, jitter, loss, currently available bandwidth, and intrinsic 
bandwidth. 

This metric is also additive, in the sense that given a network path, that 
includes a first segment and a second segment, the metric score across the 
network path is equal to the sum of the metric score across the first segment, 
and the metric score across the second segment. 

Additional features, benefits, and embodiments of the present invention 
w,ll become apparent from the detailed description, figures and claims. 



Brief Description of the Drawings 

Fig . 1 voice traffic: performance degradation (MOS - MOS,„ h ,)/(MO Sl 
- MOS,,,,,,) of some embodiments versus (a) percentage speech clipped and (J 
RTT. 1 J 

Fig. 2 short TCP connections: performance degradation 
(Latency min ,Latency) of some embodiments versus (a) the loss rate and (b) RTT. 
Fig. 3 typical file transfer: performance degradation 

iThrovghpui/nraugHpu,^) of some embodiments versus (a) the loss rate and 
(b) RTT. 

Fig. 4 ^/OS contours of some embodiments versus RTT and % speech 



lost. 

Fig. 5 normalized MOS degradation of some embodiments versus (a) 
speech lost and (b) one-way delay with negative exponentials. 

Fig. 6 shows an embodiment of approximating Latency,, J Latency of 
some embodiments versus (a) one-way delay and (2) loss rate with negative 
exponentials. s 

Fig. 7 shows an embodiment of approximating L m „ ch ,„j Lalmcy of 
some embodiments versus (a) one-way delay and (2, packet loss w itl , negative 

exponentials. 

Fig- 8 approximating ThroushputlThrou^hpm^ of some embodiments 
versus (a) one-way delay and (2) packet loss with negative exponentials. 
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Fig. 9 approximating T/voughput/Throughpui max of some embodiments 
versus (a) one-way delay and (2) packet loss with negative exponentials. 

Fig. 10 series of mappings involved in the translation of raw 
measurement traces into a web transaction score of some embodiments. 

5 Fig. 1 1 dynamics of a Simple TCP connection of some embodiments. 

Fig. 12 cwnd versus time for a TCP connection over of some 
embodiments (a) a high bandwidth path and (a) a low bandwidth path. 

Fig. 13 download of a web page which consists of one html file and nine 
images using HTTP/1 .0, with the Keep-Alive option either set or not of some 
10 embodiments. 

Fig. 14 download of a web page, which consists of one html file and 
nine images using HTTP/1.1, using either persistent connections, or persistent 
and pipelined connections of some embodiments. 

Fig. 1 5 latency of a typical web transaction for different file sizes, given 
1 5 various loss rates versus one-way delay, using either HTTP/1 .0 or HTTP/1 .1 of 
some embodiments. 

Fig. 16 typical web transaction for different file sizes, given various 
round-trip times versus the loss rate, using either HTTP/1 .0 or HTTP/1.1 of 
some embodiments. 

20 Fig. 17 latency of a typical web transaction for different file sizes, given 

various round-trip times versus one-way jitter, using either HTTP/1 .0 or 

HTTP/1 . 1 of some embodiments. 

Fig. 1 8 user's impression versus web transaction duration A. Bouch, N. 

BhattL and A. J. Kuchinsky, Quality is in Ihe eye of the beholder: Meeting 
25 users' requirements for Internet Quality of Service. To be presented at 

CHI'2000. The Hague, The Netherlands. April 1-6, 2000, pp. 297-304.and A. 

Bouch, N. Bhatti, and A. J. Kuchinsky, Integrating User-Perceived Quality into 

Web Server Design, HP Labs Technical Report HPL-2000-3 20000121 of some 

embodiments. 

30 Fig. 19 impermissible rate (%) versus MOS, N. Kitawaki and K. Itoh. 

Pure Delay Effects on Speech Quality in Telecommunications IEEE Journal of 
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Selected Areas in Communications, Vol. 9, No. 4. May 1 99 1 of some 
embodiments. 

Fig. 20 metric of a typical web transaction for different file sizes., given 
various loss rates versus one-way delay, using either HTTP/1 .0 or HTTP/1 . 1 of 
some embodiments. 

Fig. 21 metric of a typical web transaction for different file sizes, given 
various round-trip times versus the loss rate, using either HTTP/I .0 or 
HTTP/] .] of some embodiments. 

Fig. 22 metric of a typical web transaction for different file sizes, given 
various round-trip times versus one-way jitter, using either HTTP/1 .0 or 
HTTP/I. 1 of some embodiments. 

Fig. 23 network device of some embodiments deployed in a network. 

Fig. 24 network device of some embodiments deployed in an 
internetwork, at a point of presence between one or more networks. 

Fig. 25 network device of some embodiments used for routing deployed 
in an internetwork. 

Fig. 26 network device of some embodiments used for routing, deployed 
in an mternetvvork, at a point of presence between one or more networks. 

Fig. 27 shows some possible embodiments with devices that are 
communicating with each other, for example sending and receiving 
measurement packets. 

Fig, 28 shows one specific detailed embodiment with two devices 
where each device is sending and receiving measurement packets as well Is 
selecting a subset of paths. 

Fig. 29 shows an embodiment with more than two devices that are 
sending and receiving measurement packets to obtain measurements of 
performance characteristics of paths and to communicate measurements 
statistics about those paths. 
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Detailed Description 

We describe multiple embodiments of a method that translates 
elementary network parameters, wherein the plurality of one or more network 
parameters include one or more of delay, jitter, loss, currently available 
bandwidth, and intrinsic bandwidth into a metric that measures, at least in part 
quality characterizations of a same plurality of one or more network 
applications, wherein the quality characterization characterizes a quality of the 
same plurality of one or more network applications user-perceived quality 
metrics. In some embodiments of this invention, this metric captures the user- 
perceived experience of the performance of an application. This metric is 
additive, in the sense that given: 

a network path, including a first segment and a second segment; 

a first metric and the second metric, which are at least in part 
quality characterizations of a same plurality of one or more 
network applications, wherein the quality characterization 
characterizes a quality of the same plurality of one or more 
network applications running at one or more segment end-points 

wherein the first metric and the second metric are at leasi partly a 
function of a same plurality of one or more elementary network 
parameters, wherein the plurality of one or more network 
parameters include one or more of delay, jitter, loss, currently . 
available bandwidth, and intrinsic bandwidth 

wherein the first metric is at least partly the function of the same 
plurality of elementary network parameters of the first segment, 
wherein the one or more segment end points include one or more 
end-points of the first segment 

wherein the second metric is at least partly the function of the 
same plurality of elementary network parameters of the second 
segment, wherein the one or more segment end points include 
one or more end-points of the second segment. 
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adding the first metric and the second metric generates a third metric, wherein 
- the third metric is at least partly the function of the same 
Plurality of one or more elementary network parameters of the 
network path, wherein the one or more segment end points 
include one or more end-points of the network path 

- the third metric is a quality characterization of a same plurality 

of one or more applications 

In the pr „ cess of designing suoh , ^ ^ ^ ^ 
dnves the genera, methodotogy, as „eH as the methodoiogy specifics for voice 
video, TCP traffic, HTTP/I .0, and HTTP/1 . l respective,, Appl y ina the ' 
described technioue for voice, video, and data traffic, „e derive events of 
-oh a„ apphcation specific metric for each type of appiication independent,, 

General methodology 

and ,„ Le '' S a ' " K 8e " eral ° f ^~ <W<*>» versus deiav 
and o SS curves for vo , e Md Tcp ^ wtere 

performance metric that matters for the respective appiicatton: MOS score for 
vo.ce traffic, throughput and latency for TCP data traffic. (See Fi.s , , 

<n some embodiments of this i„ve„, 10 n, these curves are „ om ,aii Z ed 
Normahzmg the performance curves inc.udes transiting them to a 0- , " 
performance score, where i represents „o degradat.on, whereas 0 represents 
-a, de g rada„o„. Other embodiments can use other scaies such as i ve« 
ca,s from ,-0 and/or different scates such as 0-,0. Por exampie, in sonT 
-bo ,me„ts of this invention, in one embodiment of the vo.ee appiicatio a 
MOS (Mean Opinion Score) of 4 is converted to a metric of , forToice 21 4 

^ ™ * — * ■ system), whereas a MOS 

core of maps to a .netric of 0 (that is, tota, de S rada„o„). On the other hand in 
so,™ embod.ments of this invention, f or ge „e ra , TCP appiications i, is 
assumed that a metr.c of , for short TCP transactions transiates into a fas, 
-ha .e response, whereas a metric of 0 represents "infinite,.. sl „ w respor.se 
F-l,, ,„ some embodiments ofthis invention, for TCP flic transfer a metric 
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of 1 corresponds to a high throughput, while a metric of 0 corresponds to a 
throughput that is close to 0. Note that in the context of TCP, a loss rate, of 0 is a 
physically plausible phenomenon that gives the benchmark performance in 
terms of latency and throughput to which the performance achieved over lossy 
5 paths can be compared to. On the other hand, for some embodiments of this 
invention, a round-trip time of zero is less realistic., and leads in one 
embodiment with TCP traffic to both infinite throughput and zero latency as 
long as the loss rate is less than 100%. Hence, in some embodiments of this 
invention, the round-trip time for which the benchmark latency and throughput 

1 0 measures are computed, RTTo is larger than 0 ? and can be chosen according to 
the characteristics of the particular system in consideration. In some 
embodiments, the internetwork considered includes national networks spanning 
large geographical areas; in this context a one-way delay of less than 25 ms is 
uncommon. Hence, in such embodiments, a RTTo — 50 ms can be appropriate. 

15 In some embodiments, the choice of /?7To is significant, since different values 
of RTTo lead to different shapes for the curves in Figure 2b and Figure 3b, 
which in turn lead to different metric parameters. 

The shape these curves is very similar between many embodiments. In 
some embodiments, the curves corresponding to voice (Fig. 1) have a shape that 

20 can be approximated by a negative exponential. In some embodiments, the 
curves corresponding to TCP applications (Figs. 2-3) are hyperbolic in p and 
RTT: in some embodiments of this invention, a hyperbolic function can be 
approximated, for a portion of the parameters' ranges with an exponential 
function. The related issues are dealt with later in the appendix, in the section 

25 reserved for TCP applications. 

In some embodiments, it is assumed that these curves can be fitted by 
negative exponential functions in some portion of the parameters' ranges. In 
some embodiments, in both the case of voice and data traffic, it is also assumed 
that one-way delay is half the round-trip time delay, so performance degradation 

30 versus one-way delay curves can be obtained. The same theory can apply to 
embodiments of voice and/or TCP traffic. For simplicity, in the remainder of 
this section, we describe an embodiment in the context of voice traffic. Hence, 
in this embodiment, the following equations estimate performance degradation 
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versus one-way delay and loss, m vd and m vl , respectively., corresponding to 
voice traffic: 
m rl) = eX p(-a,.D) 
/??,., = exp(- /?,./) 

Delay-loss MOS contours have not, to the extent of our knowledge, been 
studied extensively in previous subjective studies of voice quality. Hence, in 
some embodiments, assumptions are made in the way a metric m v combining 
m vD and ,„ v/ can be obtained from both metrics' equations. In one embodiment, 
one intuitively appealing technique that can be used to combine m vD and m vl can 
be used. Given the equations above, performance is close to perfect when both 
m vD and m v , are close to 1. That is, we have 
(/"„£> = 1 and m v/ = ]) => Wl , = j 

1 5 On the other hand, note that if either ,„ vD or ,„„ are close to 0, then the quality as 
perceived by the user will be bad, i.e., ,„ v must be close to 0. Hence, the second 
relation between m vD , m v , and /;?,, ought to be 
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(>»vd = 0 or m vl = 0) => m v = 0 

One operator that satisfies both relations above is x. In this embodiment, we use 
this operator; the resulting metric for voice becomes 

»h =^x>n, =exp(-a 1 .D)xexp(-/?,./) = exp(-a,£.-A./) 

According to the obtained metric, equal metric contours in the ( A /.) plane are 
straight lines (See Fig. 4.) 

Note that since the exponential function is monotonic, the metric is also 
30 additive, in the sense described above, (which we denote MS in this document) 
is equal to 
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^ J3 V , 
metric = D + — / 



Defining 5 V to be equal to p./ccv, we get 



metric = D + 5,/ 

5 

In this embodiment, the metric has two properties: on one hand, it can be 
simply modified using a single parameter 5 V . (Other embodiments can use more 
than one parameter.) On the other hand, it is additive, in the sense described 
above. This characteristic can be useful in the context of routing; specifically, in 

1 0 some embodiments., this metric can be used for routing, using algorithms such 
as Distance Vector and SPF (Shortest Path First). That is, using such a metric, 
dogleg routing can be easily implemented. In the following, we justify this 
property in the context of this specific embodiment: assume a path includes two 
links, L\ and Zo- The one-way delay and loss across link L\ are denoted D\ and 

1 5 /(. while the delay and loss across link L2 are denoted D2 and /?, respectively. 
We also define metric\ and metric-* to be the resulting metrics on link L\ and 
link £2, respectively, while the total metric metric is set to meiric\ + metici = D\ 
tD 2 + Sr (/1 + /2). In this particular example, we use the one-way delay equation 

D. = d. +4v. r / = l s 2 

20 Clearly, D = D\ + £) 2 is only an approximation of the delay on the link 

since in some embodiments, jitter is not additive. However, in some 
embodiments, the total jitter across a path is smaller or equal to the sum of jitter 
values for all segments on the path. Hence, in these embodiments, D\ + D 2 is a 
higher bound on the actual delay D over the link. In other words, setting the 

25 delay for the total path to D\ + D2.- a 2 hop path is effectively penalized over a 
one hop path. In some embodiments, this is intuitively justifiable, since using 
two hops as opposed to one hop exhibits potentially extra overhead which is not 
taken into account by the delay metric, such as the potential additional delay 
incurred at the node connecting the two hops, and other reliability issues in 

30 terms of routing and connectivity. Hence, in such embodiments, it appears 
reasonable to penalize routes with a higher hop count 



o 



On the same line of thought, let's derive the total loss rate on the path 1 
some embodiments, the losses on Links I, and L 2 can be assumed to occur 
independently; in such embodiments, then the total loss can be found using 

!-/=(! -/.)(I-/ 2 ) 
/ = /, +/ 2 -/,/ 2 

That is, the actual loss 0n the path is lower than the value /, + / 2 assumed bv the 
addmve nature of the metric. In fact, in embodiments where the losses on Hnks 
Li and L 2 are correlated, the actual loss is generally lower than /,+/,. /,/, 
However., first note that for the range of loss rates of interest to some 
embodiments, (say 1-1 0%), ,,, 2 is much smaller ^ /( „ ^ (fcr ^ 
= 10% and / 2 - l 0% , th e„ ,,,, is a mere ]%y ^ jn ^ ^ 
can safely be ignored from the equation for /. Furthermore, in some 
embodiments, the same argument can be set forth, as was.made for the 
computation of delay, namely that penali 21ng the route that has a laraer hop 
count is justifiable from a design point of view. 

The arguments described above demonstrate the adequacv and 
practicality of the metric metric = D + 5/ for some embodiments. In the sections 
below, the theory described above, which applies to some embodiments of this 
invention, is used in the specific contexts of voice and TCP data traffic 
respectively, leading to some embodiments of tins invention for voice and TCP 
In some embodiments, it ,s useful to have a value of 5 that is adequate both for 
voice and TCP. In some embodiments the value of 6 is common for both voice 
and data traffic. 



Metric for voice traffic 

1" *is section, we approximate the normalized delation curve, 
shown in Fig. , with negative exponent:*. The results are shown in Fi, s 
Focus fc, firs, on Figure 5a, we can see ,ba, „,,i„ s „ K cur ve „,„„ an e.Joncnt.a, 
ieads to e.ther an under-es.ima.ion of the de e rada,,on for ,ow speech , oss (,l, a , 
■s, less than 4* loss rates) or ,o an over-es,im a ,,o„ of the delation for higb 
loss rates (e.g., I OS loss rates and above,. ,„ order to pick an appropriate fi, 
one ntus, remember that the curve shown in Fig. j. ^sumes no error resi.iencv 



10 



WO 02/33896 



PCT/US01/32476 



at all. That is, a lost packet is replaced with silence. However, most modern 
voice decoders have features that enable some sort of error resiliency, in such a 
way that the effect of small clips can be significantly mitigated. As a result, one 
might expect that in actual modern voice encoders, the jVJOS degradation 
corresponding to low speecli lost is not as high as that shown in Figs, la and 
Fig. 5a. In addition, it is clear that the efficiency of such error-resilient decoders 
gets lower as the packet loss increases; for loss rates exceeding 10%, no error 
correction can overcome the degradation caused by the amount of information 
lost. Hence, as shown in Fig. 5a, we fit the degradation curves with negative 
exponentials that under-estimate the degradation for low speech loss (that is, for 
values of / lower than 4%), and matches the degradation for values of/ 
exceeding 6%. We get 

exp(- 25/) < m rt < exp(-20/) 
with an average of 

m v /= exp(-23/) 

Similarly, we fit negative exponentials to the curves representing MOS 
degradation versus one-way delay D (obtained from the MOS versus RTT 
curves using the simple D - RTT/2 relation). (See Figure 5b.) Here too. in some 
embodiments of this invention, a choice has to be made between slightly under- 
estimating the loss for low r values of delay, and over-estimating the loss for 
higher values of delay. In some embodiments, the curves are derived from 
experiments conducted in N. Kitawaki and K. Itoh, Pure Delay Effects on 
Speech Quality in Telecommunications, IEEE Journal of Selected Areas in 
Communications, Vol. 9, No. 4, May 1991, for different tasks: 

• Task J : Take turns reading random numbers aloud as quickly as possible 

• Task 2: Take turns verifying random numbers as quickly as possible 

• Task 4: Take turns verifying city names as quickly as possible 

• Task 6; Free conversation 

In some embodiments of this invention, the experiments that involve 
intense interaction, such as business calls or transaction-related calls are the 
most relevant. For these embodiments, the metric is optimized for tasks that 

1 1 



resemble more Tasks J and 2 (from N. Kitawaki and K. Itoh, Pure Delay Effects 
on Speech Quality in Telecommunications, IEEE Journal of Selected Areas in 
Communications, Vol. 9, No.4, May 1991, then Task* 4 and 6. In addition for 
such embodiments, a round-trip time that is larger than 500ms (that is, one wav 
delays larger than 250 ms) is not desirable, as it gives a sense of poor quality ' 
that , s not well reflected by the low performance degradation obtained in N ' 
Kitawaki and fC. Itoh, Pure Delay Effects on Speech Quality in 
Telecommunications, IEEE Journal of Selected Areas in Communications Vol 
9, No.4, May 1991. Thus, for such embodiments, the chosen voice native 
exponential curves fit closely the results obtained in N. Kitawaki and K. Itoh 
Pure Delay Effects on Speech Quality in Telecommunications, IEEE Journal of 
Selected Areas in Communications, Vol. 9, No.4, May 1991, for low values of 
delay (that is, for D smaller than 250 ms), and over-estimates the MOS 
degradation for values of D that are larger than 250 ms (that is, for round-trip 
times that are larger than 500 ms). Hence, one embodiment that corresponds to 
this embodiment uses the following metric: 

exp(- 1 . 1 D) < m vD < exp(-2 .OD) 



with an average of 



m vD = exp(-I.5D) 

That is, « v = 1 .5, p\. = 23, yielding 5,. - 15. Also, using the bounds for and p r 
obtained above, the corresponding minimum and maximum values for 5„ 
become: 10 < 5,, < 23. 

Metric for video traffic 

In this section, we describe the derivation of one embodiment of a 
metric for video. In this embodiment, we use a model that is very similar to that 
used for voice. Indeed, it is assumed in this embodiment that the degradation of 
a voice conversation because of excessive delay is very similar to the 
degradation of a video communication. In applications such as video- 
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conferencing, the aura) sense (the fact that one end is listening to the other end) 
is complemented by the visual sense (that is, the fact that one end also sees the 
other end). As a result, in some embodiments, the video metric can be 
considered slightly less sensitive to delay than the voice metric. 

5 As far as loss is concerned, in some embodiments, the voice metric 

underestimates the effect of loss in video quality; indeed, in such embodiments, 
the loss of one video frame can affect a large number of frames. The actual 
number of frames affected depends on the encoding of the video sequence. In 
this embodiment, we use the concept of useful frames to derive, from the metric 

10 obtained for voice, a metric that can be applied for video. A useful frame 
denotes a frame that is successfully decoded at the receiver. In some 
embodiments, the effect of loss on the metric can be increased by taking into 
account the average number of frames lost by the encoder upon the loss of a 
frame as video traffic traverses a path (that is, in some embodiments, as video 

1 5 traverses the network). In such an embodiment, the ratio of frame loss (that is, 

those frames that the receiver is unable to decode successfully) vs. packet loss is 
used to obtain a scale factor that can be applied to the loss component of the 
voice metric function, yielding the loss component of the video metric. In this 
embodiment, the following methodology is used to derive this ratio. First, a 

20 model for frame loss along the path is assumed. In this embodiment, it is 

assumed that all frames are affected by lost independently, e.g., the loss model 
is Bernoulli. Those skilled in the art can follow the same methodology and 
derive a similar metric for video for other loss models considered. (Specifically, 
in some embodiments of this invention, it is useful to consider a loss model that 

25 is clustered, following a model of loss that is attributed to the internet.) The 

independent model can, in some embodiments, be applied to the Internet, since 
every frame is typically formed by more than one packet: that is. in such an 
embodiment, it is assumed that a cluster of packet loss occurs in a single frame, 
hence this independent assumption holds in the Internet. In some embodiment, 

30 it is assumed that a Group of Pictures (GOP) contains N frames, one 1 frame, 
and AM P frames. If an 1 frame is lost, the rest of the N -\ frames of the GOP 
are lost; if the firsi P frame is lost, the rest /V- 2 frames of the GOP arc aFi'ected. 



and so on. Therefore., in some embodiments, the scaling factor can be computed 



as: 



■S^video = (1//V) x f 1 + 2 + ... + (A'_ 1)] = (A/_ jy 2 
Hence, in some embodiments, the video model becomes: 

metric video = exp(-£» - 69/) 

That is, a„ = 1 .0, Pl . = 69, yielding 8, = 69. Note that in this embodiment, the 
value of a, is smaller for video than for voice, to take into account the fact that 
video is less sensitive to delay. Also, in this embodiment, the value of p\. is 
scaled by a factor SF v]deo = 3, which corresponds to N = 7. Those skilled in the 
art can derive metrics that use different values of «„ = 1 .0, p\, = 69, depending 
on the set of assumptions used. Also, those skilled in the art can use the 
methodology described above to derive various metrics, based on different 
parameters, depending on the assumptions. 

Metric for TCP data traffic 

In this section, we describe an embodiment of this invention that applies 
for generic TCP traffic. We find the appropriate values for a* ft, and hence 5, 
in this context. In some embodiments of this invention, the metric is applied to 
short TCP connections. In other embodiments, the metric is adapted to the 
throughput of a typical file transfer, assumed to be 75 KBytes; in yet other 
embodiments, the metric is to be adapted to the throughput of an infinite file 
transfer. Some embodiments that cover two cases: (1) the latency of short 
transactions (e.g., a buy button on some web site) and (2) the throughput of files 
that have a "typical" size of 75KBytes. Those skilled in the art can use a similar 
methodology to derive the metric for files of other sizes for other embodiments. 

Optimizing 5„ to capture the increase in latency for a short connection 

In some embodiments, the performance metric used for short 
connections is the latency incurred from the instant the user asks for the 



14 



WO 02/33896 



PCT/US01/32476 



transaction to be sent, to the time the packet containing the transaction arrives at 
the destination. Hence, I D = Laiency{Do)/Lalency(D) and // = 
Latency(Q)!La(ency([) are the corresponding normalized metrics, where / 
represents the one-way packet loss rate on the path, D represents the one-way 
delay, and Do is the minimum assumed one-way delay RTT 0 /2 = 25 ms. In one 
embodiment, we approximate the latter two measures with negative 
exponentials, yielding to m c fp and /??<//, respectively, i.e., 

m dD = e ^P("^dD) - Latency(Do)/Latency(D) 
nidi = ex P("PdO ~ Latency(0)/Latency(7) 

Note that the equation for wjd seems to lead to the following discrepancy: 
mjd(&o) , while Id(Dq) = 1 . It appears that this discrepancy could have been 
avoided by using a normalized version of w d o, mjd(D) = exp[-otc/(D-Do)] so that 
)n c / D (Do) = 1 . However, by doing so, we lose the similarity between the 
performance metrics for voice and data. Also, as will be seen in the following 
graphs, m c / D = exp(-a^D) approximates Id(Dq) quite well. Hence ; in this 
embodiment, there is no incentive in normalizing m dD in this fashion. In other 
embodiments, normalizing mjo could be warranted. 
0 We start with an approximation of I a, that involves the derivation of a single 
constant a £ /. This is done in Figure 6a. As can be seen from the figure, m llD (25 
ms) 1, as expected. Approximating Id with m cfD leads to an overestimation of 
the performance degradation for low one-way delays (that is, for D < 100 ms) 
and very large delays (that is, for D > 400 ms). However, the approximation is 
5 quite reasonable for D in the [0, 500 ms] range, that is, for RITs ranging from 0 
to one second. The corresponding value of aj is 4.5, yielding to m c/D = exp(- 
4.5Z>). Using the value ct f /= 4.5 thus obtained, we now approximate // using 
»hiD-nuu=- exp(-a f /A> - P<i0; the result is shown in Figure 6b. the approximation 
seems reasonable in this case too, leading only to a slight underestimation for 
0 very low (that is, lower than 1%) and very large (thai is, larger than 8%) loss 

rates. The resulting value of (3 f / is 16, and the corresponding //?,// becomes />?,// = 
exp(-16/). 



Therefore, i„ some embodimen(s> the c0lTespm]di va)ue 
^ 'V ■ ^ ^ * f ° r «"* ™»°«^ value „ f 5 is much 

z; t °; ;r; 7 , c ,han ror voice i,amc cm - 5 - -™ - 

equa! ,o 1 5). I„ ftcl , ,„ one embodiment ^ Tcp ^ ^ 

«-^*s~.r^r ,,M, " ,,-v '" ,,,, - b 

traffic, a I /„ loss rate is only worth a 36 ms de , av 
Converse y , a ,„„ ms dday „ Me ^ ^ 

o a .ere 0, % , 0 ss rate, whereas a ,00 m s d e,a y in onc £mbodimcil , ^ 

traffic is equivalent to a 2.78% loss rate 

where ^Tr"^ " ^ '° *»* 

here he nretnc neganve exponent, ra e„ic captures ,„e decrease in 

™» S hpu t for a connection of typi ca, me size. ,„ such emboim J ^ 

performance metric of interest is the W p«« , • 

In such embodiments , - r C °™ eCt, ° n 

represents the one-wav delav and D ■ « °" ^ * 

- „ ' day ' Do 15 the assumed one-way delay 

Kl - 2z ms. Assuming that we are able to 1nmn • , 
me™,™ -fi aiedblet0a PP'0.ximate the latter two 

measuies with negative exponentials vieldin* to m a 
must hence have ' * "° ^ ^ ^^'-V, we 



"do = exp(-a d D) * ThroughputfD)/ Throughput (D 0 ) 
m d , = expC-pV) * Throughput (/)/ Throughput (0) 
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Note that, as in the previous sub-section, the equation for m^o seems to lead to 
the following discrepancy: }hud{Dq) ^1, while //.>(£>o) = 1. It appears that this 
discrepancy could have been avoided by using a normalized version ofmjo, 
5 !i7 c / D (D) = exp[-a^Z)-Z)o)] so that m c / D (Do) = 1 . However, for the same reasons 
described in one embodiment with short TCP connections, some embodiments 
(described here) find no incentive in normalizing m^o m this fashion. (Other 
embodiments could, on the other hand, find it useful to normalize nijp in this 
fashion.). 

10 In some embodiments, an approximation of to can be found, which 

involves the derivation of a single constant a^. (See Fig. 8a.) As for short TCP 
connections, /77</d(25 ms) 1 , as expected. Also, the approximation is quite 
reasonable for D in the [0. 500 ms] range, that is, for RTTs ranging from 0 to 
one second. The corresponding value of is 6.5. yielding to nidD = exp(-6.5£>). 

15 Using the value of = 6.5 thus obtained, //can be approximated using /»</£>. 

= exp(-a</Do - pj/): the result is shown in Figure 8b. The approximation seems 
reasonable in this case too. The resulting value of P</ is 37. and the 
corresponding niji becomes m^i = exp(-37/). 

Therefore, for such embodiments, the corresponding value of 5d is 

20 37/6.5 = 5.7. That is, when these assumptions hold, the value of 5 is 

significantly higher than for short TCP connections (3.6), but still lower lower 
than for voice traffic (10 minimum, 15 on average). In fact, in one embodiment 
with TCP traffic, the relative importance of delay as compared to loss decreases 
as the File transferred increases. In one embodiment with typical file transfers,' 

25 \% loss rate is equivalent to a 57 ms delay, which is in between the 36 ms delay 
obtained in one embodiment with short TCP connections and the 150 ms delay 
obtained in one embodiment with voice traffic. Conversely, a 100 ms delay in 
one embodiment with typical TCP connections represents a 1 .75% loss rate, in 
between the low 0.6% loss rate obtained in one embodiment with voice and the 

30 large 2.7S% loss rate obtained in one embodiment with short TCP connections. 
In order to understand the effectiveness of these approximations in such 
embodiments when both packet loss and delay come into play concurrently, we 



show in Figure 9 the approximations for a range of loss rates and one-way 
delays. As for short TCP connections, our approximation does surprisingly well 
m most of the ranges of interest for delay and loss rate., respectively. 

Unified metric for TCP traffic 

In summary, the graphs above show that, in some embodiments, the 
appropriate values of 5, for short connections and typical file transfers are 3.6 
and 5.7, respectively. In some embodiments, it is desirable that one value of 5, 
be applied to a number of TCP applications; in some embodiments. 5, can be 
set to the average of both values, 4.6. In other embodiments, it could be 
desirable to use different values for each. Using a similar methodology, those 
skilled in the art can derive embodiments with different parameters, and 
different values for these parameters. 

Unified metric for both voice and TCP data traffic 

Summarizing our results 

Table 1 below summarizes the results described in the previous two 
sections for some embodiments. Other embodiments for voice traffic can take 
into account the effect of clip length. 

Table 1 Summary of values for this embodiment 



Traffic Type 


^ 


P 


5 


Voice 


1.5 




15 


1CF 








Short TCP 
connections 


4.5 


16 


j.6 


Typical TCP 
transfers 


6.5 


3/ 


5.7 



fn some embodiments, it is desirable to use one metric for both voice 
and tcp traffic; in at least one or more of such embodiments, the metric describe 
above with 5=10 can be used. 
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Deriving user-perceived performance measures for web applications 

In some embodiments, the metric measures the quality of specific 
applications used by humans, that use TCP for transport. For some of these 
embodiments, adequate performance metrics measure the model the subjective- 

5 user-perceived quality of the application. Hence, in some embodiments, TCP 
performance can be mapped to objective application performance, which, in 
turn, can be mapped to user-perceived quality metrics. In some embodiments, 
the application of interest includes a web transaction (http). Other embodiments 
can focus on constructing a metric for other such applications, such as telnet, 

1 0 ftp, or other. In some embodiment, a mapping between the web transaction 
duration and the underlying latency of a TCP transaction can be derived and 
used. Using this model, the duration of a web transaction can be found as a 
function of the network performance metrics (e.g., delay, jitter, and loss). In 
turn, for some embodiments, the duration of a web transaction can then be 

15 mapped to an application score, which can take into account the subjective 
perception of the application quality by the user. The sequence of such 
mappings is shown in Figure 10. In this document, we provide the insights 
behind some embodiments, that involves the choice of models at each of the 
steps shown in the procedure. We start by describing the model used for TCP 

20 transactions that corresponds to some embodiments. We then go over the 

specifics of the HTTP model used (yielding the duration of a web transaction) 
in some embodiments. Finally, we explain how, for some embodiments, the 
duration of a web transaction is mapped to a user-perceived metric. 

TCP models 

25 TCP functions and mechanisms 

In the following, we describe the Transport Control Protocol (TCP) 
modeled in some embodiments. TCP is a window-based protocol which 
principal function is to provide a reliable transport between two end-points. It 
starts with a protocol handshake, followed by the transmission of packets using 
30 a sliding-window algorithm, according to which the sending window advances 
as acknowledgments are received. A simple TCP transfer is shown in Fig. 1 1 . 
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15 



20 



TCP is also provided with mechanisms that render its utilization of resources 
(that is, bandwidth and buffer) in the network adaptive, depending on the 
conditions in the network (e.g., loading, congestion, etc.). These mechanisms 
are slow star,, conges/ion avoidance, fast retransmit, and fast recovery. In the 
following, we briefly describe each of these phases. More details can be found 
both in the original congestion avoidance paper by Jacobson V. Jacobson, 
Congestion Avoidance and Control, SfGCOMM' SS.and in the invaluable book 
by Stevens W. R. Stevens, TCP/IP Illustrated, Volume J: The Protocols, 
Addison-Wesley, 1996. 

Protocol handshake 

A TCP connection is preceded by a three-way handshake: the sender first 
transmits a SYN, to which the receiver responds with a SYN/ACK. The sender 
finally replies with an ACK upon the receipt of the SYN/ACK. (See Figure 1 1 .) 



Slow start 

It should be clear that the bandwidth used by a TCP connection increases with 
the window size used by the sender. Since it may not be known originally how 
much bandwidth is available for the transfer, a start is selecting a small window 
20 size (typically equal to one segment). After that, it increases the window size by 
■ one for each acknowledgment it receives, until one of the following events 
occurs: 

1 . Either a maximum window size is reached, which is the minimum 
among the default maximum window size used by TCP, typically 64 
KB, and the receiver socket buffer size (which can be set by the 
application, and which default varies depending on the operating 
system). 

2. Or the rate of packets sent is larger than the available bandwidth, leading 
to a packet loss, in this case, the window is set back to 1 . A packet loss 
is detected using one of two ways: either the TCP retransmit timer 
expires, or three duplicate acknowledgments for the same sequence 
number of received. More details about the detection of loss can be 
found in the fast retransmit and fast recovery section. 
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In both cases, a new phase is entered, one in which a combination of slow start 
and congestion avoidance are performed, as described below. 

Congestion avoidance 

After either the window size reaches its maximum, or the loss of a packet 
occurs, the TCP transfer enters the congestion avoidance phase. In this 
embodiment, TCP is assumed to transmit at a given window size cwnd = W (the 
sender window size is called the congestion window size, cwnd), and that a loss 
event occurs. At this point, half the value of the current window W is saved into 
a variable called the slow start threshold [ssthresh - l'V/2), and cwnd is reset to 
1 . Thereafter, TCP enters the slow start phase, until cwnd reaches ssthresh, at 
which point TCP enters the congestion avoidance phase. During this phase, and 
as long as no packet loss is detected, cwnd is only increased by \lcwnd at each 
ACK received, leading to an approximately linear increase in the window size 
with time. 

The rationale behind this behavior is as follows: the original window size was 
clearly too large, since it resulted in a packet loss. Hence, a lower window size 
must be used, and this window size is most probably somewhere between JV/2 
and H\ Accordingly, slow-start allows cwnd to reach W/2 very quickly (i.e., 
exponentially); at this point, a bandwidth discovery process is initiated, in 
which TCP attempts, in a greedy manner, to obtain as much bandwidth as it can. 
In case a packet loss occurs again at any step of the process, cwnd is reset to 1 
again, and the process is restarted again (i.e., slow start followed by congestion 
avoidance). 

Fast retransmit and fast recovery 

In our discussion of congestion avoidance, we did not differentiate between 
types of packet loss. However, the TCP sender detects packet loss using one of 
two methods: 

1 . Timeout: Once TCP sends a packet, it starts a timer RTO. If the ACK for 
that packet is not received at the expiry of RTO, the packet is recorded 



as lost. RTO is computed in one the following ways for some 
embodiments: 

• During handshake, the time-out is initialized to a large value for 
example either 3 seconds in some embodiments, 5.8 m some 
embodiments. 

• Once Round-trip time measurements are obtained, then RTO is 
obtained using the following formula in some embodiments: 

RTO = A + 4D, 

Where A is a moving average of the round-trip time, while D is a 
movmg average of the mean deviation of the round trip time, i.e. the quantity W 
- A\ where M denotes the latest round-trip time measurement obtained at the 
sender side. More specifically, for each packet received, A and D are obtained 
as follows (even though A and D are typically only updated once every 500 ms). 

A <-0 -g)A + gM 
D + gQM-A\-D) 

Where g is a small gain factor (which, in some embodiments is set to a 
negative power of two). 
2. Triple duplicate ACKs: in case a packet is lost, the receiver will operate 
for each following segment received an ACK for the same sequence 
number, which corresponds to that of the lost packet. Hence, in case the 
sender receivers a multitude of ACKs for the same sequence number it 
can. assume that the corresponding packet was lost. In TCP three 
duplicate ACKs (that is, four ACKs for the same sequence number) 
s.gnal a packet loss. Only in this case do the Fast Retransmit and Fast 
Recovery algorithms kick in for some embodiments 
Hence, once a,rip,e duplicate ACK is detected, the sender enters two processes- 
it first retransmits the segment it believes is lost. This is called a Fas, 
Retraxit (since TCP doesn't wait for n time-out to occur before 
retransmitting). Then, as in congestion avoidance, ssthrcsh IS set to half the 
current window. However, TCP sets o, w / to ssihresh in this case, hence 
avoiding the slow-start phase. 
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Typical (and simplified) traces of the TCP window size in time are shown in 
Fig. 12 for two cases: in one case, transmitting at the maximum window seldom 
saturates the link. This can be the high bandwidth case; effective bandwidth can 
be translated to a scale that is consistent to that of window. That is ; we can 
actually be showing the product of effective bandwidth and round-trip time. 
Round-trip time can assumed to be constant in the figure (if not, then the 
window size increase may not have been linear with time during the congestion 
avoidance period)). In the other case, the appropriate ideal window to be used 
is significantly lower than the maximum window size chosen by the source. 
Clearly, the difference in TCP behavior between the two cases is very 
significant. In particular, the only way the window size is limited in one 
embodiment with the low bandwidth path is through packet loss. Combined 
with TCP's greediness in its attempt to capture more bandwidth from the link, 
this fact leads to a systematic pattern of packet loss. 

5 TCP performance 

The description of TCP's mechanisms above is helpful in understanding 
how a performance model for TCP is derived. Here, the model for TCP latency 
derived in N. CardwelL S. Savage, and T. Anderson, Modeling TCP Latency, 
IEEE INFOCOM 2000, April 2000, is reviewed, as we believe it to be the 
0 appropriate in capturing some embodiments owing to its accurate modeling of 
the protocol handshake and slow-start. The equation below summarizes the 
findings of N. Cardwell, S. Savage, and T. Anderson, Modeling TCP Latency, 
IEEE INFOCOM 2000, April 2000. 



10 



^of .TCPTWcdon.OT^^.jj (Handshake) 



+ RTT 



log | f f^L ] + ] + 

I J IV.. 



(Slow - start) 



I - p IV(p) 



W{p)<W t 



max 



~ - ■ ^ / 1 - p 

1 - ~ " , W{p)>w 



i - p w 

- JL +~ + o( P ,w ) 

(Transfer of remaining bits) 
+ D .,ck (Delayed Acks) 

where 

To = T s = 3 seconds 

RTO = RTT + 4 x mdev(RTT) 
Y=l+(l/b) 

E[dss] = (Cl-(l-p) d )(i. p))/p 

E K] = (E[d ss](y ., )/y) + (W]/y) 

G(p)=l + p + 2p 2 + 4p 3 + 8 p 4 + l6p 5 + 32p 6 
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associated with the handshake, the slow-start, the first loss, and the transfer of 
the remaining bits. We describe the specifics of each and then include 
comments pertaining to the overall equation. 
Protocol handshake 

5 This equation states that the average time needed to complete the protocol 
handshake is one round trip time, to which a timeout is added to each lost 
packet in the transaction. Note that at this stage, the timeout is typically as large 
as 3 seconds for some embodiements, so even a small loss probability can be 
significant. 

10 

Slow ai art 

As described above, slow start lasts from the end of the handshake up to the 
time the window size ceases to increase (whether because a packet was lost, or 
because the window has reached its maximum value). In the context of N. 

1 5 Cardwell. S. Savage, and T. Anderson. Modeling TCP Latency, IEEE 

INFOCOM 2000, April 2000, in an attempt to make the math simpler, slow start 
is given a slightly different meaning. In fact, slow start is defined as the time 
period separating the end of the handshake up to the first loss, independently of 
whether the maximum window size W mtx is reached. Given the packet loss p 

20 and the dynamics of the window size increase during slow start, both the 

expected number of segments sent during slow start E[d S5 ] and the window size 
reached at the end of the slow start process E[W SS ] are computed. Clearly, as 
shown in the equation, in case E[W ss \>\V max , then the latency calculation is 
divided into two periods, the first capturing the exponential window growth, 

25 and the second capturing the period in which the window is constant, equal to 
W*«i«.r. It is interesting to note that in case packet loss is null, then slow start 
according to this definition lasts for the entire length of the connection. 
Conversely, in case E\}V^\<W wax > t' ien the latency associated with slow start 
only includes a single component that corresponds to the exponential increase in 

30 window size. 
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First loss 

As described above, a packet loss is detected within some time period 
that translates directly into additional delay incurred by the user. This delay 
:> depends on whether the packet was detected through a time-out or a triple ' 
duplicate. In case of a time-out, then the delay is equal to RTO. which in this 
case is set to the initial 3 seconds in some embodiments. In case of a triple 
duphcate ACK, the extra delay is as low as one round-trip time. In the equation 
above, the probability that packet loss occurs because of a timeout Q(p. w) is 
1 0 computed, given the loss rate p and the instantaneous window size w The 
latency associated with the first loss can hence simplv be computed as a 
weighted sum of RTO - T a and RTT. Since a loss event can result in more than 
one packet loss, successive time-outs can potentially occur; the factor G(p)/ 0 - 
P) takes into account this fact by appropriately scaling the first time-out value 
15 T 0 . 



Transfer of remaining bits 

The following component of the TCP latency equation corresponds to 
the latency incurred by the transfer of the bits that remain after this first loss 
Although complex, we can see that the formula basicaliy models the effect of 
loss (whether detected through timeout or the receipt of triple duplicate ACKs) 
and that of congestion avoidance (as witnessed by the computation of Wip) 
wh,ch represents the average window size given the loss rate and the dynamics 

of congestion avoidance). Hence, this cnm W nf*. 

rui.vj,, Wi u „ equauun is oasicaiiy the 

«t,o of the remaining bits to be transferred {d - E[J ss]) to the average rate of 
transfer, m turn equal to one window per time period set to the sum 0 f*7Tand 
whatever additional delay is caused by congestion avoidance. What this 
component does NOT model, however, is slow start after retransmission of 
timeouts. This means that the equation above assumes that TCP is more 
aggressive than it actually is, which results to an under-estimation of the actual 
latency involved in the transaction. However, N. Cardwell, S. Savage and T 
Anderson, Modeling TCP Latency, IEEE INFOCOM 2000, April 2000, explains 
that the effect of this negligence should be small. 
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Delayed A CKs 

The final component in the equation above models (in a trivial way) the 
extra delay caused by delayed ACKs for some embodiments. In fact, 100 ms 
5 was measured as a good average delay increase caused by delayed ACKs in 

some embodiments. ACKs are not always sent back to the sender immediately 
after a segment is received at the destination. In fact, the functionality of TCP is 
as follows for some embodiments V. Paxson, Automated Packet Trace Analysis 
of TCP implementations, SIGCOMM* 97: two packets have to be replied to 

10 immediately. However, the receipt of an individual packet is only replied to at 
the end of a clocking timer, typically set to 200 ms in some embodiments. 
Hence, 100 ms seems like a good average value to be added to take into account 
the effect of duplicate ACKs for some embodiments, 
(e.g., see M. Allman, A Web Server's View of the Transport Layer, ACM 

1 5 Computer Communication Review, Vol. 30, No. 5, October 2000. 

Comments about the model 

The model in iM. Mathis, J. Semke, and J. Mahdavi, The Macroscopic 
Behavior of the TCP Congestion Avoidance Algorithm, ACM Computer 
20 Communication Review, July 1997, is indeed simpler, but also captures much 
less of the richness inherent to the behavior of TCP. As shown above, the 
model captures, in an intuitive way, a lot about TCP's behavior. In this section, 
we go over straightforward and useful comments on the model. 

1 . Latency does not depend on the link bandwidth. In fact, this model does 
25 not directly model the link bandwidth, but considers it through its effect 

on delay, jitter and loss. For example, if bandwidth is too low, then the 
loss rate increases (as is clear from Fig. 1 1). At the same time, round-trip 
time increases through increased queue sizes at the bottleneck link TCP 
time-outs. Note that for transactions that involve small file sizes, the 
30 latency is mostly affected by the first two components, i.e. the protocol 

handshake and slow start. Hence, the throughput of the transaction is 
probably unaffected by the bandwidth of the bottleneck link (that is, 
other than the effect of the later on loss rale or RTT). 
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2. Latency is in all cases a linear function of the round-trip time. It is true 
that this is an artifact of the above assumption (since the only non- 
linearity would have come from the effect of bandwidth on the latency 
of the TCP connection). However, since the effect of bandwidth alone is 
very small, the linearity of the TCP latency with the round-trip time 
holds in most cases in real systems. 

3. Contrary to round-trip time, latency is a non-linear function of loss 
probability. In fact, for loss rates above 5%, the increase in latency with 
the loss rate accelerates significantly. 

In some embodiments, the model described above can be used to model 
TCP behavior in one or more of the steps. 

HTTP 

In some embodiments, the metric measures web transactions. In this 
section, we describe http mechanisms used in some embodiments. In some 
embodiments, such transactions use the hypertext transfer protocol (HTTP), 
which, in some embodiments defines how TCP is used for the transfer of the ' 
different components of a web page (that is, for example, the html page, the 
different images, etc). In some embodiments, the first version of HTTP 
HTTP/1 .0, described in RFC 1 945 T. Berners-Lee, R. Fie.ding, and H. Frvstyk 
Hypertext Transfer Protocol - HTTP/1. 0, IETF Request for Comment RFC 
1945, May 1996, encourages the following practices: 

1. Different components of a given web page (e.g., the html text, and each 
of the different objects) are transferred using distinct TCP connections. 

2. In order to increase the speed of the transaction, web browsers are 
allowed to open more than one TCP connection at a time. (The tvpical 
number of parallel concurrent TCP connections is 4 in some 
embodiments.) 

In some embodiments, this approach is adequate for relatively smaller 
web pages, which include a few objects, when traffic on paths on the Internet is 
relatively limited. But as the Internet has become more and more ubiquitous the 
d.sadvantages of this approach can become, in some embodiments more and 
more apparent: 
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1 . On one hand, in some embodiments, an independent TCP connection 
goes through both the handshake and slow-start mechanisms. Since, in 
some embodiments, individual objects are relatively small, so TCP 
connections spend most of their time in these stages, rather than in the 

5 congestion avoidance stage. In some embodiments, this can be 

inefficient. 

2. On the other hand, it has been shown that in some embodiments, 
opening a multitude of TCP connections simultaneously increases the 
greediness and aggressiveness of the web browser's behavior, H. F. 

10 "Nielsen, J. Gettys, A. Baird-Smith, E. Prud'ommeaux, H.W. Lie ; and C. 

Lilley, Network Performance Effects of HTTP/1 J, CSS J, and PNG, 
SIGCOMM' 97, H. Balakrishnan, V. Padmanabhan, S. Seshan, M. 
Steem, and R. Katz, TCP Behavior of a Busy Internet Server: Analysis 
and Jmprovments, IEEE Infocom 1998, S. Floyd and K. Fall, Promoting 

15 the Use of End-io-End Congestion Control in the Internet, IEEE/ACM 

Transactions on Networking, 7(6), August 1999. To understand this, we 
provide the following simple example (from S. Floyd and K. Fall, 
Promoting the Use ofEnd-to-End Congestion Control in the Internet, 
IEEE/ACM Transactions on Networking, 7(6), August 1999). Say a data 

20 transfer is divided among N parallel, concurrent TCP transactions. 

Assume a packet is lost in one of the connections. If all the data were to 
be transferred using one single TCP connection, the lost packet would 
lead to the halving of the window size, i.e. to the halving of the 
connection throughput. Instead, when N concurrent TCP connections are 

25 used, the lost packet will only halve the window size of one of the N ■ 

connections, leading to a reduction of the aggregate throughput by a 
mere l/2N\ That is, the congestion algorithm that TCP is intended to 
perform is skewed towards a much larger greediness and aggressiveness, 
leading to an increase in congestion that can in turn bear a significant 

30 degradation in the performance of all streams involved. 

In some embodiments, the first problem can be solved through the setting of the 
Keep-Alive option T. Berners-Lee, R. Fielding, and H. Frystyk, Hypertext 
Transfer Protocol - HTTP/ 7.0, IETF Request for Comment RFC 1945, May 
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max(Lu Li, La). In some embodiments., finding the exact maximum among 
various TCP connections is a complex process, as it implies knowledge of the 
latency distribution, whereas the equation described above only derives the 
average latency. In some embodiments, L„ Ifl . v can be approximated, based on the 
5 following assumption: assuming that only one connection among the four 

connections incurs a packet loss, then L ma x may simply approximated to be the 
duration of that particular transaction. Assuming that the loss probability on the 
link is equal to p 9 each connection then may be assumed to incur a loss rate p, so 
that the probability of no loss by either connection can be approximated to P„i = 

10 1- P = (\-p) A - That is. in some embodiments, the probability that either 

connection experiences a loss becomes equal to P = 1-(1 -p) A which, in some 
embodiments, can be approximated to 4p for small p. That is, in some 
embodiments, an approximate value of L max can be obtained using the equation 
for average latency, and in some embodiments, the loss rate can be set to 4p. 

1 5 Other embodiments may use different approaches to yield various models. 

Description of models used 

In the following, we describe different embodiments of models that can be 
derived for both HTTP/1.0 and HTTP/1.1 downloads. In some embodiments, it 
can be assumed that what is downloaded is one or more plurality of objects 
20 from what is assumed to be a "typical" web site. Other embodiments of this 

invention can derive these results for other HTTP embodiments , and other web 
sites. 

In some embodiments, a typical web transaction can be modeled using the 
following: 

25 1 . A one-segment request for the web page to be transfer. 

2. ... followed by a transfer of a file of size / In some 

embodiments, the actual latency formula representing the 
transfer depends on whether HTTP/1 .0 or HTTP/1 . 1 are used. 
In some embodiments, we can assume that the path is characterized by a loss 
30 rate p y a round-trip time RTT, and a round-trip lime jitter J. 

In the case of HTTP/ 1 .0, some embodiments assume that the Keep-Alive option 
is set. Also, some embodiments assume that half the file constitutes the actual 



html ASCII text, whereas the other half includes the different images in the file. 

While, in some embodiments., the ASCII is downloaded using a single TCP 
connection, the information pertaining to the images is downloadedjn parallel 
over 4 independent TCP connections. Denoting the latency of a TCP transfer of 
a file of sizeX, over a path characterized by a loss rate p, a round-trip time RTT 
and a round-trip time jitter J by L(f s ,p, RTT, J), then the total duration of the 
web transaction can, in some embodiments, be approximated to 

AnTP„.o = ZOSOOBytes,./,, RTT, J) + L(f s /2,P, RTT, J) + L(f s /8, 4p, RTT, J) 

Those skilled in the art can start with a different set of assumptions, follow a 
similar methodology, and derive various other models that derive the total 
duration of an HTTP/I .0 web transaction in function of the various dynamic 
parameters of the path. In the case of HTTP/I. 1, some embodiments assume the 
use of persistent, pipelined transactions. Since, in some embodiments, a single 
TCP transaction is used for the download of the entire file, the total duration of 
the web transaction can, in some embodiments, be approximated to 



Am-p/u = L(l 500Bytes, p, RTT, J) + L(f s , p, RTT, J). 

Those skilled in the art can start with a different set of assumptions, follow a 
similar methodology, and derive various other models that derive the total 
duration of an HTTP/1 . 1 web transaction in function of the various dvnamic 
parameters of the path. 



Some web transaction duration results 

In this section, we show a set of web transaction duration (denoted D .) 
results for some embodiments of this invention, as a function of both round-trip 
fme. loss rate and jitter. (See Figs. 15-17.) ,„ the context of these examples the 
firaphs are shown as a function of one-way delay and jitter, simply obtained'bv 
halvmg the round-trip time and round-trip time jitter, respectively. Those sk i, led 
^ deriVC eXamp ' eS < Where *. Parameters in the graphs constitute 
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round-trip parameters. Since, in some embodiments, TCP latency as shown in 
the previous section may be linear with round-trip time, the linearity of web 
transaction duration with one-way delay is expected as a result. (See Figure 15.) 
Also, it can be seen from Figure 16 that for some embodiments, the curve 
showing the dependence of D w on p may be convex, that is. its slope increases 
with increasing values of p. Since, in some embodiments, HTTP/1.0 depends on 
4p T the increase of D w with p may become increasingly significant as p exceeds 
5%. Conversely, in some embodiments, loss rate does not seem to affect much 
the duration of an HTTP/1.1 web transaction. Finally, Figure 17 reveals that in 
some embodiments, the dependence of D w on jitter is very small for both 
HTTP/] .0 and HTTP/1 .1, irrespectively of the file size; therefore, in some 
embodiments, the jitter parameter may be ignored. 

metric versus web transaction duration 

In this section, some embodiments of the remaining task of mapping 
transaction duration to a measure of quality that captures the user's perception 
are described. The embodiments described rely in part on relevant papers on the 
subject, A. Bouch, N. Bhatti, and A. J. Kuchinsky, Quality is in the eye of the 
beholder: Meeting users' requirements for Internet Qualify of Service. To be 
presented at CHl'2000. The Hague, The Netherlands, April 1-6, 2000, pp. 297- 
304. and A. Bouch, N. Bhatti, and A. J. Kuchinsky, Integrating User-Perceived 
Quality into Web Server Design, HP Labs Technical Report HPL-2000-3 
20000121 s which present experiments designed to estimate users' perception of 
quality in the context of web transactions. (The paper stresses, in particular on 
e-commerce.) In the papers, the authors have created a web site with latency 
programmability. (That is, they control the latency of the transfer of individual 
pages.) Users are asked to go through the different pages of this site, and rate 
the latency obtained as low, average of high. The final result is the graph shown 
in Fig. 1 8, that shows the percentage of users responding "low", "average" and 
"high" versus the actual duration of the web transaction. 

The results shown in Fig. 18 can be translated to some subjective 
measure of user-perceived quality, which we denote by "synthetic Mean 
Opinion Score" (MOS), from which the metric will be derived. In this respect. 



in some embodiments, a graph presented in N. Kitawaki and K. Itoh, Pure 
Delay Effects on Speech Quality in Telecommunications, IEEE Journal of 
Selected Areas in Communications, Vol. 9, No.4, May 1 99 1 , can be used that 
maps MOS values to an "impermissible rate", that is, to the percentage of users 
that think the quality is unacceptable. This graph is shown in Figure 19 For 
some embodiments, A MOS versus web transaction duration function can be 
obtamed by assuming that the "high latency" rate in Figure 18 can be 
interpreted as an "impermissible rate". I„ some embodiments, this assumption is 
reasonable, since, as far as the content provider is concerned. high latency is 
unacceptable and should never be experienced by the user. This may be thought 
to be true, especially for high value transactions., such as trading, or even 
shopping. Those skilled in the art can follow the methodology described in this 
document to use other studies of user-perceived quality and derive 
corresponding metrics. 

The resulting MOS versus latency curve can be obtained for some 
embodiments. Interestingly, the curve shows that for these embodiments the 
MOS does not degrade smoothly as transaction duration increases. In fact, in 
these embodiments a sharp decrease from a MOS of 5 to a MOS of 4 occurs 
when the transaction duration exceeds the two seconds mark. In one 
embodiment, the MOS ratings from 1 to 5 bear the following interpretations- 5 
for Excellent, 4 for Good, 3 for Fair, 2 for Poor and 1 for Bad Other 
embodiments use different values. Similarly, in some embodiments, a similar 
behavior occurs around the 8 seconds mark. 

Applying the MOS versus latency curve to the latency results: some MOS 
results. 

In some embodiments, deriving the metric from the MOS value can 
compnse a step where the 1 to 5 MOS scale is normalized to a scale ranging 
from 0 to 1 . ,n Figs. 20-22, we present the resulting metric scores corresponding 
to the web transaction duration results shown in Figs. 15-17, for some 
embodiments of this invention. 
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In some embodiments, such MOS functions can be used to derive 
metrics both for HTTP/1 .0 and HTTP/1 .1 . For some embodiments, the 
following conclusions can be drawn, as observed from the metric graphs above: 

• For some embodiments, the effect of jitter is negligible. 

• In some embodiments, the effect of loss rate is not as large as one would 
expect. In fact, in some embodiments, TCP latency increases 
considerably as the loss rate increases from 2% to 10%. However, in 
some embodiments, what is important is the user perception. For 
example, even though an increase of latency from 100ms to 2 seconds 
represents a 20-fold increase, this increase can, in some embodiments 
have a negligible effect of quality. 

• In some embodiments. The results also show that the file size affects the 
metric significantly, which, in these embodiments, is expected. In some 
embodiments, if the focus is on web transfers of the transactional type, 
then file sizes of up to 10 KBytes may be considered. 

• In some embodiments, the version of HTTP used has a surprisingly high 
effect on the results. In some embodiments, this is especially the case 
when loss rate is high (that is, larger than 5%). In some embodiments, 
the reason for this behavior is, again, the fact that the effective loss rate 
incurred in one embodiment with HTTP/1.0 is four times that incurred 
with HTTP/ 1.1. 

An additive embodiment of HTTP/1,0 and HTTP/1 A metrics 

In some embodiments of this invention, it is desirable to derive metrics 
for HTTP/1 .0 and HTTP/1 . ] that are additive (in the sense described above). In 
this section, we describe such embodiments. 

The metrics derived here match for some embodiments, those shown 
graphically in the previous section for a wide range of the elementary dynamic 
parameters of delay, jitter, and loss. Those skilled in the art can use the same 
methodology to derive similar metrics with different parameters. 
Let a, b 5 c, and d be parameters that can be tailored to the particular application. 
The embodiments described here correspond to HTTP/1.0 and HTTP/1 .1 : 
respectively: 



For HTTP/1 .0, a = 1 . 1 8, b =0. 1 3, c =0. 1 5.. and d =0.25; 
Fpr HTTP/ 1 . 1 , a = 1 .3 0, b =0.3 1 , c =0.4 1 , and d = 1 .00. 

Let 

.v = -(20.0/9.0)*(6-c); 
>' = (1.0/9.0)*(10.0*c-6); 

Letp max = x x delay + y (where delay is the delay in seconds). 
Let loss be a measure of loss on the path. 

^ doss </w /</), then we/rfc/,*,, = 1 .0 - Iog(l .0- loss/d)/log(] .Q. Pmax /d) 
if (/o.w > p,,,^ /d), metricLoss = 0.0; 

metric Del ay =1.0- {delay)/ a 

Let we/r /c denote the metric derived for the application; then for this 
embodiment, the value of metric is obtained as follows: 

If {metricLoss + «,<wZfe/^ > 1 .0), metric = me/,-/cZo„ + metricDelay -1.0 
?0 If {metricLoss + mosDelay < 1 .0), metric = 0.0. 

The metric value shown in these embodiments is clearly additive, in the sense 
described above. Those skilled in the art can derive additive metrics for other 

applications using different Darametefs ™ hw_ q „. „ ., 

. ... - ".ww, >_,u vaiucs or rnese parameters 
^ following the methodology described above. 

In some embodiments, the metric can, in addition to being performance related 
(as described above), can include non-performance related characteristics- in 
some embodiments, the non-performance related can include pre-specif.ed route 
preferences. 
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Apparatus for Characterizing the Quality of a Network Path 

In this final section, we describe some embodiments of network devices 
that characterize the quality of network paths and use these in the process of 
routing. Effectively, such a network device configured., such that if the network 
5 device is connected to at least a network path including a first segment and a 
second segment, the network device performs: 
1. accessing a first metric and a second metric, 

- wherein the first metric and the second metric are at least in part 
quality characterizations of a same plurality of one or more 

1 o network applications, wherein the quality characterization 

characterizes a quality of the same plurality of one or more 
network applications running at one or more segment end-points 

- wherein the first metric and the second metric are at least partly a 
function of a same plurality of one or more elementary network 

1 5 parameters, wherein the plurality of one or more network 

parameters include one or more of delay, jitter, loss, currently, 
available bandwidth, and intrinsic bandwidth 

- wherein the first metric is at least partly the function of the same 
plurality of elementary network parameters of the first segment, 

20 wherein the one or more segment end points include one or more 

end-points of the first segment 

- wherein the second metric is at least partly the function of the 
same pluralit of elementary network parameters of the second 
segment, wherein the one or more segment end points include 

25 one or more end-points of the second segment 

2. adding the first metric and the second metric to generate a third metric, 
wherein 

- the third metric is at least partly the function of the same 
plurality of one or more elementary network parameters of the 

30 network path, wherein the one or more segment end points 

include one or more end-points of the network path 

- the third metric is a quality characterization of the same plurality 
of one or more applications 
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In some embodiments, the network device further performs: 
prior to accessing the first or the second metric, generating at least one of the 
first metric and the second metric 

In some embodiments, the network device further performs: 

prior to accessing the first or the second metric, receiving at least one of the f irst 

metric and the second metric 

In some embodiments, the network device deals with a plurality of one 
or.more network parameters that is dynamic. J„ other embodiments, the network 
dev.ce is such that at least one of the plurality of one or more network 
parameters is static. 

As described in the method sections, the plurality of one or more 
network applications include at least one of UDP and TCP applications UDP 
applications include voice, video, whereas video applications include video 
conferencing. In some embodiments, TCP applications include HTTP whereas 
HTTP applications include HTTP/3 .0 and HTTP/1 . 1 . In some embodiments. 
TCP applications include ftp and telnet. 

In some embodiments, the network device is such that the plurality of one or 
more network parameters include delay, jitter, loss, currently available 
bandwidth and intrinsic bandwidth. 

In some embodiments, the metric can, in addition to being performance 
related (as described above), can include non-performance related 
characteristics; in some embodiments/the non-performance related can include 
pre-specified route preferences. 

In some embodiments, the network device further comprises: 

- a plurality of one or more inputs adapted to be coupled to the 
network path, 

- a plurality of one or more outputs coupled to the plurality of one 
or more inputs, wherein, responsive to a plurality of one 0r more 
packets arriving to the network device through the plurality of 
one or more inputs, the network device selects at least one output 
from the plurality of one or more outputs, wherein the at least 
one output is determined at least partly using at least one of the 
first metric, second metric, and third metric. 
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Measurement Packets 

A measurement packet is a packet sent by a sender over an internetwork 
that includes information necessary for the receiver of the packet lo compute 
measurements of the performance characteristics of the path the packet has 

5 traversed over that internetwork. The information includes information for a 
receiver of the measurement packet to compute measurements of performance 
characteristics of at least a portion of the path of the measurement packet; and 
data including one or more of measurement statistics, a generic communication 
channel, network information, and control data directing a receiver of the 

10 measurement packet to change one or more configuration parameters of the 
receiver. 

In some embodiments of the invention, the information included in the 
measurement packet to compute measurements includes at least one of a 
timestamp of a sending time of the packet and a number to identify the packet 
1 5 by itself and/ to identify the relative position of the measurement packet in a 
sequence of measurement packets, 

In some embodiments of the invention, the measurement packet is 
implemented using the following data structure: 

20 struct MeasurementHeader { 

/** 

* A generation number. This value represents when the 

* sender began sending. This value is a standard Unix 
25 * timestamp that seconds since Jan 1, 1970 UTC . 

+ * / 

uint32_t mGeneration; 
/** 

30 * A sequence number for the packet. This increments each 

* time a packet is sent and rolls over when 16 bits is 

* exceeded. 

w 

uintl6_t mSequence; 



/** 

* The IP address the packet is sen t to 
**/ 

uint3 2_t mDstAddr; 
/** 

* The send timestamp for this packet. 

+ * I 

uint64_t mSendTime; 



The mGeneration field is used ,o detec, when a sending process has 
started a new session. This field is used by the receiver ,o determine tha, a 
^continuity in the stream's sequence numbers is the result of a sender restart 
...her than due to iarge network latencies, duphcate packets or dropped packed 
The sequence number ^ne. eiela is incremented by one each 
^ iS Mt ™ S a ~ a"ows the receiver to deduce lost and 
duphcate packets by identifying missing and dup.icate sequence numbets 

The mSendTime field contains the time a, which the packet was sen, 
represented as microseconds since January 1 , 1 970 UTC Tins field is ' 
compared to the time the packet arrived a, ,„e rece.ver to determine the delav 
between (lie sender and the receiver. 

In some embodiments of the invention, a piurality of one or more 

packets are sen, over a path continuously. ,„ some embodiments of the 

■nventton, the continuous stream of packet is denoted as a measurement stream 

bach measurement stream is uniauelvid~.ifi.Hi,..., 

IPiAw -r, —"juic source and destination 

HH ' ma " ,,ainS ° M S ° Ck " d =- ^>»'- source ,P 

a ^ess ,t sends from and writes the destination ,P address in.o the mDstAddr 
field. On the receiver side, ,he source IP address is returned bv the rec,n 
system ca„ and the destina.ion address is retrieved from the measuremen 
packet. 
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Data fnclnded in the Measurement Packets 

In measurement packets that contain sufficient space, data will be 
included, including one or more of measurement statistics, a generic 
communication channel, network information, and control data directing a 
5 receiver of the measurement packet to change one or more configuration 
parameters of the receiver. 

Some embodiments of the invention will add a single type of data to 
each packet. Some embodiments of the invention will use a complex data, 
including subpackets. 

10 Some embodiments of the invention use subpackets that include a single 

byte subpacket type identifier, followed by a 2-byte length field (including the 
length of the type and length fields) and finally including the data that is to be 
sent. One embodiment will store all values in network byte order. Other byte 
orders will be apparent to those skilled in the art. The following data structure 

15 definition describes some embodiments. 

class SubPacket { 
/* 

* The type identifier for this subpacket. 
20 */ 

uint8_t mType ; 

/* 

* The length of this subpacket, in network byte order . 
.25 */ 

uint I6_t mLength; 

}; 

One embodiment of this invention will include data describing a 
30 momentary snapshot of the measurement statistics for a given path between a 
sender and a receiver. 

In some embodiments of this invention, this data will include one or 
more of the following information: the source and destination IP addresses that 
define the path, a measurement packet size for which the statistics have been 
35 calculated as well as computed measurement statistics that are at least partly 



10 



15 



20 



2d 



30 



response ,o deiay; computed measurement statistics that are a, ieas, partlv 
responsive to Jitter and computed measurement statistics that are a, teas, partlv 

responsive to packet loss. " 

In one embodiment of this invention, these statistics will be in tmits of 
™_ds expressed as 64-bi, floating-point ,ua„,i,ies and transmitted in a 
standard network byte order. 

In one embodiment of this invention, the following data structure wil, 
store the computed statistics: 



class TunnelStatsSubPacket : public 

lc snapshot was taken (in 

e 1970) . 
uint64_t mTimestamp; 



lc SubPacket { 

* The time that this statistic 

* ra i^oseconds since 1970) 
**/ 



* to. 
uint32_t mSrcAddr,- 
/** 

* The destination IP address of ^ - 
statistics S tUnnel these 

. * apply to. 

*v 

uint32___t mDstAddr; 
/ * * 



* all packet size 

■k-k J 

uintl6_t mPktSize; 
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/** 

* The average delay in microseconds . 
**/ 

5 double mDelay; 

/** 

* The average jitter in microseconds . 
**/ 

10 double mJitter; 

/** 

* The percentage of packets that have been lost, in the 
range 

15 * o to i . 

**/ 

double mLoss; 



20 Some embodiments of this invention include the time at which the 

statistics were computed such that those statistics are sent over multiple paths 
for improved reliability and to take advantage of one path having less delay than 
another. One embodiment at the receiving end is able to compare the 
computation times of received statistics to place them in their original temporal 

25 order, regardless of their relative arrival times over the paths. 

Some embodiments of this invention will send computed statistics 
specific to the paths that are part of the plurality of one or more paths that are 
between the specific sender and receiver. Other embodiments will send 
additional computed statistics for paths that are not one of the plurality of one or 

30 more paths that are between the specific sender and receiver. 

Some embodiments of this invention will include network information 
concerning network topology including but not limited to information retrieved 
from routers such as in-bound or out-bound link utilization, inbound or out- 
bound link bandwidth and/or CPU utilization. Other network information 

35 determined from routers and other network devices will be apparent to someone 
skilled in the art. 



Some embodiments of this invention will also include control data 
directing a receiver of the measurement packet to change one or more 
configuration parameters of the receiver. 

In some embodiments of the invention, thecontrol data will instruct a 
receiver to alter its configuration, including but not limited to zero or more of 
the following examples: instructing a receiver to initiate sending a plurality of 
one or more measurement packets., change one or more of the measurement 
packet sizes, inter-measurement packet transmission times and mix of packet 
sizes, and stop sending one or more of the plurality of measurement packets. 

In some embodiments of the invention, this control information will 
include notification of measurement devices that have joined or left the 
network. 

In many embodiments of the invention, the measurement packets will be 
encrypted by the sender and decrypted by the receiver. Some of these 
embodiments will use IPSec. 

In some embodiments of the invention, the encryption and decryption 
will be done by an external device using IPSec. 

Other encryption and decryption options will be apparent to one skilled 
in the art. 

In some embodiments of the invention, the measurement packers will be 
digitally signed. 

In some embodiments of the invention, a generic communication 
channel will be used by a sender and a receiver to communicate data between 
them. 

Performance Characteristics of a Path 

Measurements are used to compute performance characteristics of the 
paths traversed by the measurement packets. The measurements can either be 
computed from the measurement packets themselves, or extracted from the 
arbitrary data carried by the measurement packets. The measurements of 
performance characteristics include at least one or more of one-way 
measurements and round-trip measurements. The performance characteristics 
mclude at least one or more reachability, delay, jitter, loss, available bandwidth 
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' and total bandwidth. Other performance characteristics will be apparent to those 

skilled in the art. 

In some embodiments of the invention, delay measurements are 
computed as the interval of time from the moment the measurement packet is 
5 sent by the sender to the moment of time the measurement packet is received by 
the receiver. The sending time is carried by the packet, and it is measured by the 
clock the sender refers to. The receiving time is measured by a clock that the 
receiver refers to, which may or may not be synchronized with the sender's 
clock. 

[ 0 In some embodiments of the invention, the clock of the sender and the 

clock of the receiver are synchronized. A plurality of one or more precise clock 
inputs such as GPS, NTP, IR1G and NIST will be used. Some embodiments of 
this invention will use the same clock as an input to more than one of the 
plurality of one or more senders and receivers. Iri ; some embodiments of the 

] 5 invention, the clock of the sender and the clock of the receiver are the same. 

In some embodiments of the invention, the clock of the sender and the 
clock of the receiver are not synchronized, and mechanisms based on the 
measurement data are used to correct the clock skew and clock drift, the 
mechanisms including using minimum delay across multiple measurement 

20 samples, and using a mechanism to track the minimum delay over time. 

Some embodiments of the invention will use the minimum round-trip 
delay between the devices to place a lower bound on clock skew. 

Some embodiments of the invention will use the lower bound of 
multiple paths between the sender and receiver to further reduce the lower 

25 bound. 

Some embodiments of the invention will correct for clock drift by 
tracking the relative clock skew between the sender and receiver over time and 
adjusting for the slope of the drift. 

In some embodiments of the invention, jitter measurements, also known 
30 as inier-measurement packet delay variations, are computed as the difference in 
delay on consecutive, successfully received packets. 



In some embodiments of the invention, jitter can ako h, 
«* Terence between the instantaneous ^ ^ " 
delnv „f nil ' packet, and (he average 

delay of all the measurement packets previously received. 

In some embodiments of the invention t„ 
* by assigning a timeout value to e "ments are computed 

Instan, of time after w Z IT ^ "« 

Packet has no, arrived ^ ^7™" ^ ^ "* ' 

mem packet In some embodiments of the invention ,h„ • . 
transmission time can be estimated if , he reC e iv e r ^ ' """" 

Pa«ern of transmission of measurement ZZZ T 

i-ention, the transmission de,av of pacta cfb " "** 

J"*r performance characteristics ^ °" =» d 

Performance characterist.es of a path could be the measurement 
themselves, or statistics on those measurements ,„ the s, n ,,s, 
^rithm ,s used to updates the statistics ^J^^.' 

obtatned wi , , e arriva, of eve, new p^CT ^ 

an. In some embodiments of the ■ PPare "' * *"* *" ed in «" 

based on the Rnhh- w moving average, an averace 

^ Z e I :; e M r° 3 " • bJ,- 

in thel rf '° b = — <° those 

In some embodiments of the invention ,Hp „ • 

«P°ne„tia„ y movi ng a vera S e computed u "m^" " ^ 
Robbins-Moro stocha«,V , ■ • R <*b,ns-Moro estimator. The 

ec,„a,ion: PPr0X '" ,a " 0n ** « of the 
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where E is the expectation, f(t) a function and x the estimator. The general form 
of the solution is: 

x(0 = x(t-l) + alpha * [f(Q - x(l -!)] = (!- alpha) * x(t-l) + a//;/?a */(?; 
or. with alpha = (J - u) t 

x=p *x+(l-p) *f 

p is the weight of the estimator, and determines the amount contributed to the 
average by the function.. In some embodiments of the invention, // is constant. 
In some embodiments of the invention, // is a dynamic value, whose value 
depends on the last value of the function f according to the formula: 

M = e *(-f/K) 

where K is a constant that also determines the importance of the last value off 
with respect to the current value of the estimator x. 

In some embodiments of the invention, average delay can be computed using an 
exponentially moving average as follows, 

d = p * d + (1 - p) * m 

where d is the exponentially moving average of delay, m is the last delay 
sample, and p is the weight of the moving average. 

In some embodiments of the invention, average jitter can be computed using an 
exponentially moving average as follows, 

v = / , * v + (1 - p) * \d-m\ 

where v is the exponentially moving average of jitier. \d - m\ is the last sample 
of jitter, and // is the weight of the average. 
0 



•17 



In some embodiments of th 
exponentially moving average as follows, 



e invention, average jitter can be computed using an 



/ = d + A/ - * v 
Where d is the average delay, v is the 



average jitter and M is a constant. 
In some embodiments of the invention average |«™ u 

expo„ M(iall) . moi ,„ g awrage as fo „: v ;; as s CM be compu,ed — - 

where p-hat is the moving average of the loss d - /n f > • 

i ■ ' P ~ i° " packet is received ! ; c 

;:nr 1 ,s d ,os,i ' and ' ,s ,he - « — — >• 

notion o f n r e mb ° dimemS ° f im ' en,i °"' " is d — « *« 

notion of forgiveness against a single pacfcei loss Th, f„ • 

—a, of «i„,e between the ,i me t e pa cke s ^ " 

me packet Joss occurs and the tiW th* 

is, .he forgiveness period wilI end ^ n c ' ^ " h, °" n - T "" 

-i ved after „,e ,oss. ,,en tJs ^JZT ^ ^ ^ 
rate. e been at a certain 



packets is needed before // mn k» ^ ♦ 

can be determined, and this value is known as 



as the 
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forgiveness threshold. In some embodiments of the invention, the forgiveness 
threshold is chosen arbitrarily. In some embodiments of the invention, the 
forgiveness threshold takes the value: 

K ( I ' M) 

5 This value is half of the value of the estimator after the singe loss occurs, and 
thus we call it the half-life threshold. Similarly., we also call the forgiveness 
period under this threshold the half life period. The advantage of using a 
forgiveness threshold greater than zero is that issues related to host-dependent 
floating-point representations reaching that value are avoided. 

10 In some embodiments of the invention, p is computed by comparing the value 
of the estimator after n consecutive packet arrivals since the loss with the half- 
life threshold: 

p-hat = (1- f() * fi*n< % (J - u) 

Given that n is known because is determined by the value of the half life period 
] 5 and the transmission rate, u is computed as: 

p = exp(0n V 2 )/n) 

In some embodiments of the invention, two thresholds are defined, an upper 
threshold and a lower threshold. When the value of p-hat exceeds the upper 
threshold, the loss is not forgiven until enough measurement packets are 
20 received consecutively so that the value of p-hat gets below the lower threshold. 

Other mechanisms to compute p will be apparent to for those skilled in the art. 

Path Description 

In some embodiments of the invention, the path traversed by the 
25 measurement packets from the sender to the receiver is such that the path is at 
least partly implemented with at least one of a GRE tunnel, an IPSEC tunnel 
and IPonIP tunnel. Other path implementations using tunnel will be apparent for 
those skilled in the art. 
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In some embodiments of the invention, the path traversed by the 
measurement packets from the sender to the receiver is implemented with a 
virtual circuit, including a frame relay PVC, an ATM PVC or MPLS. Other path 
implementations using virtual circuits will be apparent for those skilled in the 
art. 

Other path implementations will be apparent to those skilled in the art. 

Internetwork Description 

In some embodiments of the invention, the internetwork is implemented 
by a plurality of one or more subnetworks, including a plurality of one or more 
VPNs, a plurality of one or more BGP autonomous systems, a plurality of one 
or more local area networks, a plurality of one or metropolitan area networks, 
and a plurality of one or morewide area networks. 

In some embodiments of the invention, the internetwork is implemented 

by an overlay network. 

Other internetwork implementations will be apparent to those skilled in 



the art. 



Packet Sizes and Transmission Times 

In some embodiments of the invention, the measurement packets are of 
varying sizes, including 64, 256, 512, 1024, 1500 bytes. 

In some embodiments of the invention, the size of the measurement 
packets is specified with an external API. 

In some embodiments of the invention, the measurement packets are of a 

fixed size. 

In some embodiments of the invention, the measurement packet sizes 
and times between measurement packets simulate the traffic pattern of a 

plurality of one or more applications 

In some embodiments of the invention, traffic patterns correspond to voice 
applications, where the packets re of small size, e.g., 30 bytes, and the ,nter- 
transm.ssion time between consecutive packets is constant, e.*., 10 ms These 
examples do not limit the possible size values and inter-transmission time 
values. 
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In some embodiments of the invention, traffic patterns correspond to 
video applications, where the packets size is the largest permitted to be 
transmitted by an internetwork without being fragmented, and the inter- 
transmission time between consecutive packets varies depending on the spatial 
5 and temporal complexity of the video content being transmitted, the 
compression scheme, the encoding control scheme. 

In some embodiments of the invention, traffic patterns correspond to the 
plurality of applications observed in an internetwork, including at least one or 
more of HTTP transactions, FTP downloads, IRC communications, NMTP 
10 exchanges, streaming video sessions, VoIP sessions., videoconferencing sessions 
and e-commerce transactions. Other types of applications will be apparent to 
those skilled in the art. 

In some embodiments of the invention, the inter-measurement packet 
transmission times are of varying length. 
1 5 In some embodiments of the invention, the inter-measurement packet 

transmission times are of fixed length. 

In some embodiments of the invention, the inter-measurement packet 
transmission times specified with an external API. 

In some embodiments of the invention, the length of the inter- 
20 measurement packet transmission times is randomized according to a 

distribution. In some embodiments of the invention, this distribution is based at 
least in part on a uniform distribution. In some embodiments of the invention, 
this distribution is based at least in part on an exponential distribution. In some 
embodiments of the invention, this distribution is based at least in part on a 
25 geometric distribution. Other distributions will be apparent to those skilled in 
the art. 

In some embodiments of the invention, the length of the inter- 
measurement packet transmission times is provided by a table. 

In some embodiments of the invention, the length of the inter- 
30 measurement packet transmission times is controlled by a scheduler. In some 
embodiments of the invention, the scheduler uses a priority queue, keyed on 
desired send time. 



Other mechanisms to specify the inter-measurement packet transmission 
time will be apparent to those skilled in the art. 

Other packet sizes and transmission times will be apparent to those 
skilled in the art. 
Path Selection 

It is possible that multiple alternative paths between a sender and a 
recover are available through an internetwork at any given moment 
Performance characteristics of each of these paths can be used to select a subset 
of the paths. 

In some embodiments of the invention, the subset of the plurality of 
paths IS selected based at least in part on at least one of: one or more of 'the 
measurement statistics from the measurement packet and one or more of the 

computed statistics. 

In some embodiments of (he invention, ,he selection of the subset of the 
plurality of paths is based at least partly on the position of paths in a rar*i„. ,„ 
some embodiments of the invention, the ranking is a, least partiy based on'one 
or more of the measurement statistics included as data in the measurement 
packet. In some embodiments of the invention the rankmg is a, leas, P a rtly 
based on the computed statistics of the path. In some embodiments of the 
-ent.on the ranking is implemented by using a comparison function to 
compare the paths. and by ordering the pa[hs .„ , ^ ^ 

embodtments of the invention the ranking is implemented by usin. a 
companson function to compare the paths, and by ordering the paths in'an 
■ncreasmg order. Other ranking technique, wil, be apparent to those skiHed i„ 

the art. i w 



in some embodiments of the invention, the ranking is based on a single 
score associated t0 each path Ifl some embodjments Qf ^ 

is denoted Magic Score (MS), and it is computed as follows- 



MS = ML * MF 
ML = d + M * v 
MF = del I a * p-hat + J 
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where ML is the Magic Latency, a component of the MS obtained using delay 
and jitter respectively calculated with statistics; and MF is the Magic scaling 
Factor that multiplies the value of ML, and is computed based on loss statistics. 
M is a constant that takes several values, including 4, for example. MS can be 

5 seen as a scaled-up version of ML, and the scaling factor MF is a function of p- 
hat and delta, a constant. As p-hat not only reflects loss but also detects large 
delay spikes before they happen, p-hat can be seen as an indicator of the 
departure of the path from a "normal mode" operation., and thus the scaling 
' factor is only applied when there are loss or spikes. The goal of MF is to 

1 0 differentiate between paths that have very similar delay characteristics, but with 
one having losses and the other not having them. 

In some embodiments of the invention, ML is used as a delay indicator, 
given that jitter is accounted as an increase in delay. In contrast, MS, although a 
scaled version of ML, cannot be used to indicate delay, except when MF = 1 (p- 
1 5 hat = 0), which leads to MS = ML. That means the value of MS is useful not by 
itself but to compare it with the MSs of other tunnels. 

In some embodiments of the invention, loss statistics can be used as a 
" discriminator instead of a scaling factor. That is, p-hat can eliminate paths 

experimenting loss. Then, the remaining paths can be selected using MS = ML. 

20 In some embodiments of the invention, the selection of a subset of paths 

is based on applying at least one or more thresholds to at least one of more of 
the statistics. 

In some embodiments of the invention, a single threshold is used, and 
computed as a certain percentage of the highest score of the paths. In some 
25 embodiments of the invention, the threshold is determined by subtracting a 
fixed quantity to the highest score of the paths. 

In some embodiments of the invention, the number of paths in the subset 
of paths is fixed. In some embodiments of the invention, this fixed number of 
paths N out of M paths is determined such that the probability of having loss in 
30 (M - N) paths simultaneously is less than a certain threshold. In some 
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15 



embodiments of the invention, this probability is a binomial, with the 

assumption that all paths have the same probability of loss. 

In some embodiments of the invention, the selecuon of the subset of the 
Plnrahty of paths is based at ,eas, partly on a probability associated with eaoh 
path. In some embodiments of the invention, the probability of each path is at 
leas, partly based on one or more of the measurement statistics included as data 
in the measurement packet. 

In some embodiments of the invention, the probabilities of each path are 



equal. 



In some embodiments of the invention, the selection of the subset of the 
Plnrahty of paths is based a, leas, partly on the cos, of the path 

In some embodiments of the invention, the selection of the subset of the 
pHnaltty of paths is based a, leas, partly on the amount of bandwidth consumed 
over a period of time. 

those ,T,' POSSibili ' ieS " COmPU ' e ^ Pr0babi " fa WU ' * I*-* <° 

those skilled in the art. 

Other m echanisms to select a subset of the paths will be apparent to 
those skilled in the art. 
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CLAIMS 

What is claimed is: 

1 . A method for characterizing a quality of a network path, including a first 
segment and a second segment, the method comprising: 

accessing a first metric and a second metric, 

wherein the first metric and the second metric are at least in part 
quality characterizations of a same plurality of one or more network 
applications, 

the quality characterization characterizes a quality of the same 
plurality of one or more network applications running at one or more 
segment end-points, 

the first metric and the second metric are at least partly a 
function of a same plurality of one or more elementary network 
parameters. 

the plurality of one or more network parameters include one or 
more of delay, jitter, loss, currently available bandwidth, and intrinsic 
bandwidth, 

the first metric is at least partly the function of the same plurality 
of elementary network parameters of the first segment, 

the one or more segment end points include one or more end- 
points of the first segment, 

the second metric is at least partly the function of the same 
plurality of elementary network parameters of the second segment, and 

the one or more segment end points include one or more end- 
points of the second segment; and 

adding the first metric and the second metric to generate a third metric, 

wherein the third metric is at least partly the function of the same 
plurality of one or more elementary network parameters of the network 
path, 

the one or more segment end points include one or more end- 
points of the network path, and 



the third metric is a quality characterization of the same plurality 
of one or more applications. 

2. The method of 1 , further comprising: 

prior to accessing the first or the second metric, generating ar least one 
of the first metric and the second metric, 

3. The method of 1, further comprising: 

prior to accessing the first or the second metric, receiving at least one of 
the first metric and the second metric. 

4. The method of claim 1, wherein at least one of the plurality of one or 
more network parameters is dynamic. 

5. The method of claim 1, wherein at least one of the plurality of one or 
more network parameters is static. 

6. The method of claim 1 , wherein the plurality of one or more network 
applications include at least one of UDP and TCP applications. 

7. The method of claim 6, wherein the plurality of one or more network 
applications include UDP applications. 

8. The method of claim 7, wherein the plurality of one or more network 
applications include voice. 

9. The method of claim 7, wherein the plurality of one or more network 
applications include video. 

10. The method of claim 9, wherein the plurality of one or more network- 
applications include video conferencing. 

1 1 ■ The method of claim 6, wherein the plurality of one or more network 
applications include TCP applications. 
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12. The method of claim 1 1 , wherein the plurality of one or more network 
applications include HTTP. 

] 3. The method of claim 12, wherein the plurality of one or more network 
5 applications include HTTP/1 .0. 

1 4. The method of claim 1 2, wherein the plurality of one or more network 
applications include HTTP/1 . 1 . 

10 15. The method of claim 1 1 . wherein the plurality of one or more network 
applications include ftp. 

16. The method of claim 1 1, wherein the plurality of one or more network 
applications include telnet. 

15 

1 7. The method of claim L wherein the plurality of one or more network 
parameters include delay. 

1 S. The method of claim 1 ? wherein th© plurality of one or more network 
20 parameters include jitter. 

1 9. The method of claim 1 , wherein the plurality of one or more network 
parameters include loss. 

25 20. The method of claim 1 , wherein the plurality of one or more network 
parameters include currently available bandwidth. 

2 1 . The method of claim 1, wherein the plurality of one or more network 
parameters include intrinsic bandwidth. 

30 

22. The method of claim 1 ? wherein the metric includes non-performance 
related characteristics. 



•r -7 



23. The method of claim 21, wherein the non-performance related 
characteristics includes pre-specified route prefere 



rences. 



24. A network system, comprising: 

a plurality of one or more network devices configured, such that if the 
network device is coupled to at least a network path including a first segment 
and a second segment, the plurality of one or more network devices performs 
accessing a first metric and a second metric, 

wherein the first metric and the second metric are at least in part 
quality characterizations of a same plurality of 0ne or more network 
applications, 

the quality characterization characterizes a quality of the same 
plurality of one or more network applications running at one or more 
segment end-points, 

the first metric and the second metric are at least partly a 
function of a same plurality of one or more elementary network 
parameters, 

the plurality of one or more network parameters include 
more of delay, jitter, loss, currently available bandwidth, and intri 

bandwidth, 

the first metric is at least partly the function of the same plurality 
of elementary network parameters of the first segment, 

the one or more segment end points include one or more end- 
points of the first segment, 

the second metric is at least partly the function of the same 

plurality of elementary network parameter* nf f i« 

^Pa'ameteis ot the second segment and 

the one or more segment end points include one or more end- 
points of the second segment; and 

adding the first metric and the second metric to generate a third metric 
wherein the third metric is at least partly the function of the _ 
Plurality of one or more elementary network parameters of the network 



one or 
nsic 



same 



path, 
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the one or more segment end points include one or more end- 
points of the network path, and 

the third metric is a quality characterization of the same plurality 
of one or more applications. 

25. The network system of 24 a wherein the network device further performs: 
prior to accessing the first or the second metric, generating at least one 

of the first metric and the second metric. 

26. The network system of 24 5 wherein the network device further performs: 
prior to accessing the first or the second metric, receiving at least one of the first 
metric and the second metric. 

27. The network system of 24, wherein at least one of the plurality of one or 
more network parameters is dynamic. 

28. The network system of 24, wherein at least one of the plurality of one or 
more network parameters is static. 

0 29. The network system of 24, wherein the plurality of one or more network 
applications include at least one of UDP and TCP applications. 

30. The network system of 29, wherein the plurality of one or more network 
applications include UDP applications. 

5 

3 1 . The network system of 30, wherein the plurality of one or more network 
applications include voice. 

32. The network system of 30, wherein the plurality of one orrnore network 
0 applications include video. 

33. The network system of 32, wherein the plurality of one or more network 
applications include video conferencing. 



34. The network system of 29, wherein the plurality of one or more network 
applications include TCP applications. 



35. The network system of 34, wherein the plurality of one or more network 
applications include HTTP. 



36. The network system of 35, wherein the plurahty of one or more network 

applications include HTTP/1.0. 



37. The network system of 35, wherein the plurality of one or more network 
applications include HTTP/1.1. 



38. The network system of 34. wherein the plurality of 0 „e or more ne(work 

applications include ftp. 



39 The network system of 34, wherein the pl „ rality of one or more network 

applications include telnet 



40. The network system of 24, wherein the plurality of one or more network 

parameters include delay. 



41 . The network system of 24, wherein the plurality of one or more network 

parameters include jitter. 



42. The network system of 24, wherein the plurality of one or more netWQrk 
parameters include loss. 



43- The network system of 24, wherein the plura.ity of one or more network 
parameters include currently available bandwidth. 

44. The network system of 24, wherein the plurah ty of one or more network 
parameters include intrinsic bandwidth. 
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"45. The network system of 24, wherein the metric includes non-performance 
related characteristics. 

46. The network system of claim 45, wherein the non-performance related 
5 characteristics includes pre-specified route preferences. 

47. The network system of 24 ; further comprising: 

a plurality of one or more inputs adapted to be coupled to the network 

path; and 

! 0 a plurality of one or more outputs coupled to the plurality of one or more 

inputs, 

' wherein responsive to a plurality of one or more packets arriving to the 
network device through the plurality of one or more inputs, the network device 
selects at least one output from the plurality of one or more outputs, and 
! 5 the at least one output is determined at least partly using at least one of 

the first metric, second metric, and third metric. 
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